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‘ fossil’ markers to trace back the speciation time 


of two sibling fig wasp species 
LOU Wen-Jing'” , TENG Wen-Jia’” , LIU Hai-Yang'” , LI Zi’ , 
XIAO Jin-Hua’, WEI Zhong-Yuan* , HUANG Da-Wei'”’* 


(1. College of Life Sciences, Hebei University, Baoding, Hebei 071002, China; 2. Key Laboratory of 
Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China) 


Abstract: Nuclear mitochondrial pseudogenes ( NUMTs ) are DNA fragments transferred from 
mitochondria to the nucleus. They are a double-edged sword for phylogenetic analyses because of their 
independent evolutionary histories. Herein, we employed a targeted, PCR-based procedure to study 
NUMTs originated from a Nad1-12S fragment in two sibling fig wasps associated with Ficus hispida: 
Philotrypesis pilosa and Philotrypesis sp. These sibling fig wasps arising from sympatric speciation live in 
the same syconium, which makes them good models to investigate subtle behavioral and genetic 
divergences in similar ecological niches. Further investigation relies on a correct estimate of the separation 
time of both species. Through the analysis based on the acquired NUMTs, we found that the origin of the 
NUMTs in both species appears to be a very recent event; however they may arise in the common ancestor 
of both species. Since the NUMTs integration is a recent event, we assumed that these newly produced 
NUMTs evolve similarly to genuine mtDNA. Then we calculated the genetic divergence timing based on 
mitochondrial genes, an average substitution rate of about 2. 3 x 10° substitutions/site/year. These 
typical characters make them good ‘ fossil’ markers to trace back the speciation time of both species back 
to 0.40 -0.48 mya. The results suggest that some NUMTs are good ‘fossil’ markers for tracing back 
important evolutionary events including speciation. 
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1 INTRODUCTION 


Nuclear mitochondrial pseudogenes ( NUMTs ) 
DNA that have 
transferred to the nucleus and represent “ molecular 


2001 ). PCR 


amplification and sequencing of mitochondrial genes 


are copies of mitochondrial 


fossils” ( Bensasson et al., 
from total genomic DNA is susceptible to the 
contamination of corresponding NUMTs. They evolve 
as nuclear pseudogene sequences and have lower 
evolutionary rates than their mitochondrial 
counterparts ( Zischler et al., 1995; Lin and 
Danforth, 2004). The improper use of NUMTs as 
mitochondrial obscure or even 
undermine phylogenetic inference ( Arctander, 
1995; Zischler et al., 1995; Zhang and Hewitt, 
1996 ), although sometimes they may be useful 
outgroups ( Zischler et al., 1995). Since their first 
discovery (Du Buy and Riley, 1967) , NUMTs have 


markers may 


been found ubiquitously in the nuclear genomes of 


many eukaryotic organisms, including plants, 
yeasts, alveolates, nematodes, insects, and 
vertebrates ( Adams et al., 2000; Bensasson et al., 
2000, 2001; Henze and Martin, 2001; Hazkani- 
Covo et al., 2010). 

Two fig wasp species, Philotrypesis sp. and P. 
pilosa ( Hymenoptera; Chalcidoidea) are associated 
with the fig tree Ficus hispida (Moraceae) (Jiang et 
al., 2006 ). 


characters and 


The evidence from morphological 
molecular phylogenetic 
reconstructions supports that they are sibling species 
arising from sympatric speciation (Jiang et al., 


2006). Since both in the 


ecological niche (the same fig syconium), it is 


species live same 


meaningful to compare subtle behavioral and genetic 
divergences between them ever since speciation. 
However, the origin date of the two species remains 
NUMTs have lower 


unknown. Generally, 
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evolutionary rate than mitochondrial genes. Since 
NUMTs integration is a recent event, NUMTs might 
have not stayed in the nucleus sufficiently long 
enough to ‘perform’ completely as nucleic 
fragments. It is assumed that these newly produced 
NUMTs evolve similarly to genuine mtDNA 
(Thalmann, 2005). As ancient events, NUMTs are 
fossils of the mtDNA and can help dating molecular 
( Thalmann et al., 

Previous studies of other hymenopteran 
species indicate that the 
sequence region encompassing partial sequences of 
the mitochondrial genes Nad1 and 12S ( Nad1-L1- 
16S-A-12S ) is a hotspot of NUMTs production 
( Behura, 2007). In the present study , we amplified 


the NUMTs originated from this region and made use 


divergence between species 
2005 ). 


insect mitochondrial 


of them to date the speciation of the two fig wasps. 


2 MATERIALS AND METHODS 


2.1 Fig wasps and DNA extraction 

Specimens were collected from Danzhou (19° 
30'’N, 109°31’E), Hainan province, China. The 
wasps were deposited in 95% ethanol and stored at 


— 20°C for DNA extraction. 


Before DNA extraction, every wasp was washed 
with 70% ethanol and then with sterile water several 
Fifteen 
individual wasps were used to extract total genomic 
DNA using the EasyPure Genomic DNA Extraction 
Kit ( TransGen, China ). 


encoding cytochrome c oxidase subunit | ( COI) 


times to remove surface contaminations. 


Beijing, The gene 
was used as reference to evaluate DNA quality. 


Negative samples were excluded from further 
analysis. 
2.2 DNA amplification and sequencing 

We used universal primers N1-J11876 and SR- 
N14745 (Simon et al., 2006) to amplify sequences 
in Philotrypesis sp. We then modified the primers for 
amplification of P. pilosa. All primers are listed in 
Table 1. PCR cycling conditions were 94°C for 4 
min, followed by 35 cycles of 30 s at 94°C, 30 s at 
the optimal annealing temperature ( T, ) (Table 1) 
and 1 min at 65%C , and a final elongation at 65°C for 
10 min. Sterile water served as the template for 
negative control. PCR products were purified by 
EasyPure Quick Gel Extraction Kit ( TransGen, 
Beijing, China), and then cloned with pEASY-T1 


vector ( TransGen, Beijing, China). Positive clones 
were sequenced by Invitrogen ( Beijing, China). 


Table 1 The list of primers for amplification of Nad1-12S 


Species Primer sequences (5’ —3’) 


Philotrypesis sp. 


P. pilosa 


F; CGAGGTAAAGTMCCWCGAACYCA 
R: GTGCCAGCAGYYGCGGTTANAC 


F; CGAGGAAAAGTACCCCGAACTCA 
R; GTGCCAGCATCAGCGGTTATAC 


Product size (bp) 


TCC) 
mtDNA NUMTs 
55 2 807 805 
56 2 852 802 


Primers were designed according to Simon et al. (2006). T, represents the optimal annealing temperature. 


2.3 Sequence analysis 
We applied 
NUMTs from their mitochondrial origins. 


several criteria to distinguish 
These 
included the existence of premature in-frame stop 
codons, indels, a high amount of point mutations 
and false phylogenetic relationships (Sorenson and 
Quinn, 1998; Song et al., 2008). 

We aligned multiple sequences utilizing the 
Clustal W implemented in the MEGA 4.0 with the 
default parameters (Tamura et al., 2007). Gathered 
data involved indels and the shift of open reading 
frames (ORFs). 

Neighbor-joining (NJ) trees were constructed 
from Kimura-2-parameter ( K-2-P ) distances 
(Tamura et al., 2007) using MEGA 4. 0. Nodal 
support was estimated using bootstrap analysis with 
1 000 replications. 

Genetic diversity was estimated by the number 


of segregating sites (S), haplotype number (H), 
haplotype diversity (Hd) and nucleotide diversity of 
per site (Pi) as implemented in DnaSP v5 ( Librado 
and Rozas, 2009). The existence and strength of 
selection was calculated using DnaSP v5. This 
synonymous and 


involved the number of 


nonsynonymous substitutions, the number of 


replacement substitutions per replacement site 
(Ka), the number of synonymous substitutions per 
synonymous site (Ks), the average value of all 
comparisons of Ka | Pi(a) |, and the average value 
of all comparisons of Ks | Pi (s)] (Lynch and 
Crease, 1990). Tajima’ s D, Fu and Li’ s D and Fu 
and Li’s F were calculated in DnaSP v5 to test if, 
according to their frequency spectrum of genetic 
sequences 


neutral evolution (Fu and Li, 1993; Aris-Brosou 


and Excoffier, 1996 ). 


diversity, the referred conformed to 


To identify the purifying 


56 & 


550 ERIR Acta Entomologica Sinica 


selection and population expansion, we analyzed the 
population size changes with DnaSP v5 to further 
estimate population trends. A unimodal distribution 
was assumed to indicate a population bottleneck or 
expansion, while a ragged multimodal mismatch 
distribution implied the demographic equilibrium of 
the population. 
2.4 Estimation of divergence time 

To investigate the divergence time between the 
authentic mitochondrial sequences and the NUMT 
sequences, we analyzed the NUMT sequences and 
the authentic mtDNA sequences found in both 
species ( Philotrypesis sp. and Philotrypesis pilosa) . 
The relative rate test (Tajima, 1993) suggested the 
application of one single substitution rate for all 
sequences. The divergence time was estimated using 
the equation, 6 = (2A)7T, in which 6 is the 
sequence divergence between the sequences, A is the 
substitution rate and T is the divergence time 


(Thalmann et al., 2005). 


3 RESULTS 


3.1 Highly conserved NUMTs and pre-mature 
stop codons in both species 

The expected length of mitochondrial Nad1-12S 
segment amplified with our primers in Philotrypesis 


sp. is 2 807 bp and in P. pilosa is 2 852 bp ( Xiao et 


m 
0.05 


Fig. 1 


71 


al., 2011). We obtained 8 and 11 NUMT sequences 
and P. pilosa ( GenBank 
accession numbers KC193191 - KC193209 ), 
respectively, with the length of 805 bp and 802 bp, 
which indicated an extremely long ( about 2 kb) 
deletion in the NUMTs when compared to the 
mitochondrial sequences. Phylogenetic positions 
(Fig. 1) further confirmed that they are NUMTSs. 
Each of the NUMTs sequences in both species is 
divided into two parts with the 400th site as the 
dividing line; the first half belongs to Nad1, and the 
second half belongs to 725. We tried to translate the 
part potentially encoding Nad1 and detected that; 1) 
the translated amino acid sequences are highly 
the NUMTs and 


mitochondrial gene; 2) an in-frame stop codon exists 


from Philotrypesis sp. 


conserved among genuine 
at the same site for all the amino acid sequences 
suggesting that these 
NUMTs arose in the common ancestor of both species 
(Fig. 2). 
3.2 Diversity and selection pressures on NUMTs 
We analyzed the extent of polymorphism among 
the NUMT sequences in both the whole fragments 
and the parts corresponding to Nad1 and 12S (Table 
2). In all treatments the indicators of polymorphism , 


S, H, Hd, and Pi, of NUMTs in P. pilosa were 
higher than those in Philotrypesis sp. However, 


obtained in both species, 


differences in sample numbers (11 sequences from 


N1-12S PpK1-NUMT4 
N1-12S PpK1-NUMT5 
N1-12S PpK1-NUMT3 
N1-12S PpK1-NUMT1 
N1-12S PpK2-NUMT1 
N1-12S PpK2-NUMT2 
N1-12S PpK1-NUMT2 
N1-12S PpK1-NUMT6 
N1-12S PpK2-NUMT4 
N1-12S PspK1-NUMT1 
N1-12S PspK1-NUMT2 
N1-12S PspK1-NUMT4 
N1-12S PspK2-NUMT2 
N1-12S PspK2-NUMT3 
N1-12S Psp mtDNA 
N1-12S Pp mtDNA 
N1-12S Ab mtDNA 




















42 


N1-12S Pp NUMTs 
41 


89 


N1-12S Psp NUMTs 


N-J tree based on Nad1-12S fragment and corresponding NUMTs in Philotrypesis pilosa and P. sp. 


mtDNAs of A. bakeri, P. sp. and P. pilosa are shown as N1-12S Ab mtDNA, N1-12S Psp mtDNA and N1-12S Pp mtDNA, respectively. NUMTs of P. 
sp. are labeled as N1-12S Psp NUMT and those of P. pilosa are shown as N1-12 Pp NUMTs. The haplotypes of NUMTs in P. sp. and P. pilosa assemble 


in different clades. The bar indicates sequence divergence of 5%. 
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N1-12S5 Pp mtDNA AEGE SELVSGE MS 
N1-12S PpK1i-NUMT1 AEGE SEL VSGEF MS 
N1-12S PpK1-NUMT2 AEGE SELVSGE MS 
N1-12S PpK1-NUMT3 AEGE SEL VSGE MS 
N1-12S PpK1-NUMT4 AEGE SELVSGE MS 
N1-12S PpK1-NUMT5 AEGE SEL VSGE MS 
N1-12S PpK1-NUMT6 AEGE SEL VSGE MS 
N1-12S PpK2-NUMT1 AEGE SEL VSGE MS 
N1-12S PpK2-NUMT2 AEGE SEL VSGE MS 
N1-12S PpK2-NUMT3 AEGE SEL VSGE {S$ 
N1-125 PpK2-NUMT4 REGE SEL YSGE be 
N1-12S PpK2-NUMT5 AEGE SELVSGE {9 
N1-12S Psp mtDNA AEGE SEL VSGEF {9 
N1-12S PspK1i-NUMT1 AEGESELVSGEF MS 
N1-12S PspK1i-NUMT2 AEGESELYVSGEF MS 
N1-12S PspK1i-NUMT3 AEGESELVSGF MS 
N1-12S Pspki-NUMT4 AEGESELVSGF MD 
N1-12S PspK1i-NUMT5 AaEGESELVSGE MS 
N1-12S PspkK2-NUMT1 AEGESELYVSGF {S$ 
N1-12S PspkK2-NUMT2 AEGESELVSGF MS 
N1-125 PsapK2-NUNT3 AEGESELVSGE MS 
N1-125 PspkK2-NUMT4 AaEGESELVSGF MS 
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LFAMIFISEY@ MPI LENLETMGGNIF SILFYLMYLFLL 
LFAMIFISEY@M MPIOYLENLESMGGNIYSILFYLMYHYLL 
LFAMIFISEYQO MPP LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEY@0 MPV LENLESMGGNIYSILFYLMYAYLL 
LFEFAMIFISEYQO MPO LENLESMGGNIY SI *FYLMYAYLL 
LFAMIFISEYQO MPO LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEYQO MPO LENLESMGGNIYSILFYLMYHYLL 
LFAMIFISEY@0 MPI LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEY@0 MPO LENLESMGGNIY SILFYLMYAYLL 
LFAMIFISEYQO MPO LENLESMGGNIYSILFYLMYAYLL 


MSGLEAMIFISEYG iH MPIggvryLENLISMGGNIYSILFYLMYYWYLL 


LFAMIFISEY@0 MPO LENLESMGGNIY SILFYLMYAYLL 


LFAMIFISEY@0 MPO LENLESMGGNIYNILFYLMYAYLL 
LFAMIFISEYQO MPO LENLESMGGNIY SILFYLMYAYLL 
LEFAMIFISEY@01 MPO LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEY@0 MPO LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEY@M MPO’ LENLESMGGNIYSILFYLMYHYLL 
LFAMIFISEYQO MPO LENLESMGGNIYSILFYLMYAYLL 
LFAMIFISEYG@AM MOY LENLESMGGNIYSILFYLMYHYLL 
LFAMIFISEY@0 MPO LENLESMGGNIYSILFYLMYAYLL 


GLEFAMIFISEY@ MO MiMawvwLENLISMGGNIYSILFYLMYNYLL 


LFAMIFISEY@M MPO LENLESMGGNIC SILFYLMYAYLL 


Fig. 2 Stop codon ( * in the frame) in Nad1-12S NUMTs of Philotrypesis sp. and P. pilosa 
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Nad1 fragments and corresponding NUMTs sequences are translated with the invertebrate mitochondrial code table. The amino acid sequences of both 


mtDNA and NUMTs in Philotrypesis sp. and P. pilosa are highly conserved. 


P. pilosa vs. 8 from Philotrypesis sp.) precluded 
evidence for higher diversity in P. pilosa. Within taxa, 
the diversity of NUMT Nad1 (Pi: 0. 00313) was 
higher than that of NUMT 12S ( Pi: 0. 00248) in 
Philotrypesis sp., while the diversity of NUMT 12S 
(Pi: 0.00591) was higher than that of NUMT Nad1 
(Pi: 0.00409) in P. pilosa. 

All indicators of neutral selection were 
significantly negative (Table 3). NUMTs Nad1-/2S in 
Philotrypesis sp. and P. pilosa were not experiencing 
neutral evolution, perhaps because of either purifying 
selection or population expansion (Fu and Li, 1993; 
1996 ). Mismatch 


distribution analyses indicated multimodal distributions 


Aris-Brosou and _ Excoffier, 
(Fig. 3), which ruled out population expansion. 
Selection pressure estimated by Pi (a)/Pi(s) for 
NUMT Nad1 (Table 3) showed purifying selection in 


both Philotrypesis sp. and P. pilosa. 
3.3 Estimation of speciation time 

We calculated the genetic divergence time 
based on mitochondrial genes, an average 
substitution rate of about 2. 3 x 107" substitutions/ 
site + year in the mtDNA for arthropods ( Brower, 
1994). Based on the NUMT representing Nad1, the 
average substitution/site | Pi(s) | was 0.01102 in 
Philotrypesis sp. and 0.00915 for P. pilosa ( Table 
3). We calculated the divergence time of the NUMT 
in P. sp. as 0.48 mya, and 0.40 mya in P. pilosa. 
Considering that the emergence of these NUMTs 
occurred before the separation of the species, we 
thus estimated that these sibling species are very 
young, having an origin later than 0.40 —0.48 mya. 


Table 2 Analyses of polymorphism on Nad1-12S NUMTs in Philotrypesis sp. and P. pilosa 


Species NUMTs segment N S 
Philotrypesis sp. Nad1-12S 8 8 
Nad1 5 

128 4 

P. pilosa Nad1-12S 11 22 
Nad1 9 

128 13 


H Hd Pi 


0.786 (0.151) 0.00249 (0.00091 ) 


4 0.643 (0. 184) 0.00313 (0.00118 ) 
3 0.464 (0.200) 0.00248 (0.00136) 
9 0.945 (0. 066) 0.00500 (0.00137 ) 
6 0.727 (0. 144) 0.00409 (0.00129 ) 
6 0.727 (0. 144) 0.00591 (0.00206 ) 


N; The number of NUMTs; S; Segregating sites; H; Haplotype number; Hd: Haplotype diversity; Pi: Nucleotide diversity of per site. 
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Table 3 Analysis of molecular evolution on Nad1-12S NUMTs in Philotrypesis sp. and P. pilosa 


Species NUMTs segment Tajima’s D Fu and Li’sD Fu and Li’s F Pi(a) Pi(s) Pi(a)/Pi(s) 
Philotrypesis sp. Nad1-12S -~1.70118*  -1.85903* -2.01588 * NA NA NA 
Nad1 ~ 1.59524 ~ 1.73524 ~ 1.87594 0. 00263 0.01102 0.239 
12S ~ 1.53470 ~ 1. 66525 ~ 1.79736 NA NA NA 
P. pilosa Nad1-12S -2.13077* «== 9, $0031" -2.81061 * NA NA NA 
Nad1 ~1.96045*  -2.36231" -2.55641 *™ 0. 00285 0.00915 0.311 
125 -2.04601 *  -=2.47723*™* -2.67974 * NA NA NA 


Pi(a): The average value of all comparisons of Ka; Pi(s): The average value of all comparisons of Ks. * ; P<0.05; ™:; P <0. 02. 


Frequency 





Pairwise differences 


Frequency 





0 5 10 15 20 25 


Pairwise differences 


Fig. 3 Mismatch distribution analysis of Nad1-12S NUMTs in Philotrypesis sp. (A) and P. pilosa (B) 


The observed curves with many peaks or free of waves mean equilibrium populations , whereas unimodal curves represent population expansion. Multimodal 


distributions in both species rule out the population expansion. Exp; Expected distribution; Obs; Observed distribution. 


4 DISCUSSION AND CONCLUSION 


The amino acid sequences of all NUMT 
sequences in both species are highly conserved with 
respect to their 
sequences ( Fig. 2). 
selection, the origin of the NUMTs appears to be a 


genuine mitochondrial protein 
In addition to purifying 


very recent event. 

In addition to providing us with knowledge 
unobtainable from contemporary samples, NUMTs 
verified here shed light on evolutionary history of 
species through dating molecular divergences. In this 
case study, the divergence time of the NUMT in 
Philotrypesis sp. is 0.48 mya, which is before that of 
P. pilosa. 

Due to the limits of the primers we designed, 
we could not sequence all NUMT WNad1-/2S 
Regardless, our further 
elucidated the channel of inheritance between the 


nuclear and mitochondrial DNA. Some NUMTs are 


good ° fossil’ markers for tracing back important 


fragments. analyses 


evolutionary events including speciation. 
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Ze i a (x et A FAVED FG HES 
Fes /)\ RE OB tok AH AY AAE BF TB) 


EH, ROE, SUE, EF, Ree, Mew, BAD 


C1. WAGE ree Be, WERE 071002; 2. PRA bh Worse, PRA bah WUE -S ASCH EAk, JE 100101 ) 





HE: BORAGE (nuclear mitochondrial pseudogenes, NUMTs ) 45 FHAE PRAY ZR A SEL E EI ANHE AN 
H DNA FEE. EFEMERA, NUMTs CEE RSA TTAB LI, RIEF PCR PY 
ETRE T XIRA Ficus hispida E RPH Philotrypesis pilosa #1) Philotrypesis sp. PREI FRAIA Nad1-12S F 
EKI NUMTs, AARP E h RARE RIE, EE EE BM SE ERRA), BE 
FY VA FAVE 1B FB Ft de ESE EAD) AE AS EB Bb AARRE FEE, LETRA EAB F 
Xt PI apo EKS ER. ET TRY) NUMTs HEAT HEE OT, RIEM: 1) 7x26 NUMTs #84 Rei 45 | 
IGE A SF; 2) NUMTs 5| ASF AA TED OME ZA o h FX NUMTs 5AA TB] fet ee, Si BE ER DE 
Z ES ORT ARE A READ, TAT TY BS BR ARE BY SE Sy a ER PR TRE Ay 2.3 x 10S R/T + OE AB LEE 
AG SERIE FY R BBR AT TRG IR AE DBS ZB oP ET EE M E 0. 40 -0.48 ANEAN o BGR GEA, — HEA AL 
ABE FY VA 4B RF BY FAVE ot TAEA He HE oT E BE PE BX 
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